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Abstract. We consider a natural generalization of the Partial Vertex 
Cover problem. Here an instance consists of a graph G — (V, E), a cost 
function c : V ^ Z"*", a partition Pi, . . . , P,- of the edge set E, and a 
parameter ki for each partition Pi. The goal is to find a minimum cost 
set of vertices which cover at least ki edges from the partition Pi. We 
call this the Partition-VC problem. In this paper, we give matching 
upper and lower bound on the approximability of this problem. Our 
algorithm is based on a novel LP relaxation for this problem. This LP 
relaxation is obtained by adding knapsack cover inequalities to a natural 
LP relaxation of the problem. We show that this LP has integrality gap 
of O(logr), where r is the number of sets in the partition of the edge set. 
We also extend our result to more general settings. 



1 Introduction 

The Vertex Cover problem is one of the most fundamental NP-hard problems 
and has been widely studied in the context of approximation algorithms 21, lOj. 
In this problem, we are given an undirected graph G — {V, E) and a cost function 
c : V ^ Z+ . The goal is to find a minimum cost set of vertices which cover all 
the edges in E' : a set of vertices S covers an edge e if S' contains at least one 
of the end- poin ts of e. Several 2- approximation algorithms are known for this 
problem [H. Il6|. The Partial Vertex Cover problem is a generalization of the 
Vertex Cover problem, where we are also given a parameter k. The goal is to 
find a minimum cost set of vertices which cover at least k edges. This problem 
was proposed by Bshouty and Burroughs and they gave a 2-approximation 
for this problem using LP-rounding. Since then, many different techniques have 



been shown to give 2-approximation algorithm for this problem ( jlTl \% 

In this paper, we consider a natural generalization of the Partial Vertex 
Cover problem. Here an instance consists of a graph G — {V,E), a. cost function 
c:V ^ Z+, a partition Pi, . . . , P,. of the edge set E, and a parameter ki for each 
partition Pi. The goal is to find a minimum cost set of vertices which cover at least 
ki edges from the partition Pi. We call this the Partition-VC problem. In this 
paper, we give matching upper and lower bound on the approximability of this 
problem. We give an 0(log r)-approximation algorithm for the Partition-VC 
problem, and show that unless P=NP, we cannot do better. Recall that r denotes 



the number of sets in the partition of the edge set E. Note that for constant 
number of partitions this gives a constant approximation. Our resuhs also extend 
to the slightly more general problem where edges have weights, and we would 
like to pick a minimum cost set of vertices which cover edges of total weight 
at least 77^ for each partition Pi. We call this the Knapsack Partition Vertex 
Cover problem. 

Our Techniques: The hardness result for the Partition-VC problem follows 
by an approximation preserving reduction from the Set Cover problem. The ap- 
proximation algorithm uses a novel LP relaxation - this is the main contribution 
of the paper, and we expect this idea to have more applications. The natural 
LP relaxation for even the Partial Vertex Cover turns out to be unbounded. 
Indeed, consider the following example : the graph is a star - there is a vertex 
V of degree D and all its neighbors are leaves. All vertex costs are 1, and the 
parameter fc = 1. Clearly, any optimal solution must cost at least 1 unit. But a 
fractional solution will pick the vertex v to an extent of , and hence will cover 
all the D edges fractionally to an extent of So, the fractional solution pays 
only . One way of getting around this problem is to augment the LP with more 
information. Here, we can guess the most expensive vertex an optimal solution 
will pick, and can remove all vertices with cost more than the cost of this vertex. 
Further, the cost of this vertex is also a lower bound on the optimal value. This 
idea was used by [l3| to give a 2-approximation for the Partial Vertex Cover 
problem. However, applying such an idea to the Partition-VC problem turns 
out to be non-trivial. We cannot guess the most expensive vertex in each of the 
partitions - this will take time exponential in r. Our approach is to strengthen 
the natural LP relaxation such that no guesswork is required. We show how to do 
this using knapsack cover inequalities [7|]. Armed with this stronger relaxation, 
we show that one can carefully use randomized rounding based techniques to get 
the approximation algorithm. 

Related Work: There has been recent work on partial covering versions of 
several covering problems. For the set cover problem, the partial covering version 
namely the partial set cover problem was first studied by Kearns [3| , who proved 
that the approximation ratio of the greedy algorithm is at most 2H(n) + 3, where 
n is the size of the ground set and H{n) is the n*'' harmonic number. Later 
Slavik [20*1 showed that it is actually bounded by H{k) where k is the number 
of elements to be covered. This natural greedy approach when extended for the 
Partition-VC problem gives only a iJ(|l^|)-approximation, which is much worse 
than the lower bound of O(logr) that we have proved for this problem. 

The partial vertex cover problem has also been widely studied in the lit- 
erature. Bshouty and Burroughs Q were the first to give a polynomial time 
2-approximation algorithm for it. Subsequently, several other algorithms based 
on Lagrangian Relaxation, local-ratio, prim al-dual techniques with the same ap- 
proximation guarantee were proposed (ITI [3, [13, Mestre's [3| primal-dual 
technique can also be used to get a 2-approximation for a more general version of 
the problem, the partial capacitated vertex cover problem. Bar- Yehuda et al. 0] 
gave constant factor approximation algorithms for several variants of this prob- 



lem using the local-ratio technique. Partial versions have also been studied for 
the Facility Location problem and its variants. Charikar et al. [Sj] explored the 
outlier or robust version of the uncapacitated facility location problem and k- 
center problem where only a fraction of the clients need to be serviced. For both 
the problems they gave constant approximation algorithms. Apart from these 
there are several other partial covering problems: e.g. k- median with outliers [3, 
k-MST problem [l^l and k-multicut problem 15]. However, these approaches do 
not seem to work for the Partition-VC problem. 

The set of partitions of the edge set in the Partition-VC problem are a 
special case of matroids. There has been significant work on maximizing a sub- 
modular function under matroid constraints 22, but none of these results 
apply to the Partition-VC problem. 

The rest of the paper is organized as follows. We present the hardness of the 
Partition-VC problem in Section[21 Our rounding algorithm and its analysis is 
presented in Section |31 Finally concluding remarks are made in Section [3) 



2 Hardness of the Partition-VC problem 

In this section, we prove that it is NP-hard to get better than 0(log r)-approximation 
for the Partition-VC problem. We give a reduction from the Set Cover prob- 
lem. Recall that an instance X of the set cover problem consists of a set X 
containing r elements, and a set S of subsets 5*1, ... , Sm of X. The goal is to 
find minimum number of sets in S such that their union is X. This problem is 
known to be NP-hard, and in fact, it is known that unless P=NP, any polyno- 
mial time algorithm for the Set Cover problem must have approximation ratio 
of /?(logr) jli]- 

We now describe the reduction in detail. Let X be an instance of the Set 
Cover problem as described above. We now construct an instance X' of the 
Partition-VC problem. The graph G in X' is a bipartite graph. The vertices 
on left side, are defined as follows : for each set Si G 5, we add a vertex s\ 
to V^. All these vertices have unit cost. The vertices on the right side, are 
as follows : for every element u G X, we have a corresponding vertex u' G V^. 
Each of these vertices has infinite cost. Now, we define the set of edges E' and 
the partition Pi , . . . , (note that the number of sets in the partition is same 
as the size of the set X in I). For a vertex s\ G and v! G V^, we have an 
edge between them in P' iff u G Si in the instance X. We partition the set P' 
as follows : for every v! G V^, define Pu' as the set of edges incident to v! . The 
partition of P' is {P„' : v! G V^}- Further, the quantities which tell how 
many of the edges in the set Pu' need to be covered, are 1. This completes the 
description of the instance X' . The following lemma is now easy to see. 

Lemma 1. There is a solution to X of cost C iff there is a solution to X' of cost 
C . Hence, unless P—NP, any polynomial time algorithm for the Partition-VC 
problem must have approximation ratio of f2 (log r). 



Proof. Any solution to I picks a some subsets Si^,. . . ,Si^ in S. Then we can 
have a solution of the same cost for I' in which we pick the corresponding 
vertices in V^. Similarly, consider a solution to X' . None of the vertices picked 
by this solution can be in (because of infinite cost). Thus, we can look at the 
corresponding subsets in <S, and these subsets will form a set cover in I. 



3 Approximation Algorithm for the Partition-VC problem 



In this section, we give an 0(log r)-approximation algorithm for this problem. 
We begin with the natural LP relaxation, and then strengthen it by adding 
knapsack cover inequalities. Fix an instance T consisting of a graph G = {V, E), 
partition Pi, . . . , P^. of E, and parameters ki, . . . ,kr. The natural LP relaxation 
is described below. For every vertex v gV, we have a variable Xy which should 
be 1 if we pick this vertex, and otherwise. For an edge e, we have a variable 
ye, which should be 1 if e gets covered by the solution, otherwise. 



min CyXy (LPl) 
vev 

Xu+ Xy > tje for all edges e = {u,v) G E 
Pe > ki for every partition Pj 

< Xj, < 1 for all vertices v <E V 
< j/e < 1 for all edges e e E 

As explained in the introduction, the integrality gap of this LP relaxation is 
unbounded even for the Partial Vertex Cover problem. 

To improve the integrality gap, we add knapsack cover inequalities as follows. 
Consider a subset of vertices A. Suppose we select all the vertices in A. Now, 
A will cover some of the edges in each of the partitions Pj. Define ki{A) as 
the number of edges we still need to cover from Pi (after having picked A). So, 
we must choose enough vertices from V \ A such that the remaining covering 
constraint in every partition is met. In other words, the following conditions 
must be satisfied 



Xu+Xv > Ve Ve = (u, v) G E and u,v ^ A 
2/e > ki{A) for every partition Pj 

e—{u,v)GPi 

If we replace the variable j/e in the second inequality above by using the first 
inequality, we get that for every partition Pj, 



^ {xu + Xv) > ki{A) 

e—{u,v)^Pi 

i.e., ^ degi{v,A)xy > ki{A), 

where deg^(t', A) denotes the degree of v in the subgraph of G considering edges 
in Pi only and removing the vertices of A. Since Xy have value lying the range 
[0, 1], we can further strengthen the above by truncating the values degj(u, A) to 
mm{deg^{v , A) , ki{A)) . Thus we get the following strengthened LP relaxation: 

min CyXy (PVC-LP) 
vev 

mm{ki{A),deg^{v,A))xy > ki{A) for all partitions P,, subsets A CV 

(1) 

< < 1 for all vertices v £V 

We shall show that the integrality gap of the above LP relaxation is 0(log r). 
We first present the rounding algorithm, and then we will discuss how to solve 
this LP. So assume that we have a solution x* to the LP above. The rounding 
algorithm is described below. 



Algorithm 1: Rounding a solution to the PVC-LP 
Given a solution x*. 

Let X be the integral solution that we will build. Initially, x„ = for all v. 
for \/v eV do 

if Xy > 1/6 then 
Set Xy ^ — 1 

else 

Set Xy <— 1 with probability 6a;*. 
end if 
end for 

Pick the set of vertices v in the solution for which af„ = 1 



Analysis of the Rounding Algorithm 

We begin by showing that the solution constructed above is good for any 
partition Pi with constant probability. 

Theorem 1. For any partition Pi, the solution x covers at least ki edges of Pi 
with probability at least 5/8. 



Proof. Define A to be the set {w S F : a;* > 1/6}. Note that our algorithm picks 
all the vertices in A. Therefore, we just need to show that the vertices picked 
from V\A cover at least ki {A) edges from Pi after we remove the vertices in A. 
Let l3i{v,A) denote '"'"(^^gi^('^^)''''(^)) _ Then the constraint (HI) applied to this 
particular set A implies that 

(2) ^AKAK>i. 

Lemma 2. For any partition Pi, 

Y,Pt{v,A)xv < 2, 

v^A 

happens with probability at most 3/8. 

Proof. The proof is a simple application of Chebychev's inequality and uses the 
fact that the quantities /3i{v,A) are at most 1. For a vertex v ^ A, let be an 
indicator random variable which is 1 if w is included in the solution (i.e., Xy = 1), 
otherwise. Let Zi denote ^^f^j^ I3i{v, A)Yj,. The expected value E[Zi] can be 
expressed as 

E[Z,] = ^6/3,(w,AX > 6, 

using the inequality ([2]). We now bound the variance Var{Zi) of Zi. 
Var[Z,] = Y,f3r{v,A)^Var{Y,) 

v^A 

= ^AKA)2.6<(l-6x:) 

v^A 

< 6 ^ A)xl because ^(t;. A) < 1 

v^A 

The claim now follows from Chebychev's inequality. Indeed, we want to bound 
the probability Pr[.Zi < 2]. This can be done as follows : 

Pr[Z, < 2] < Pr[Z, < E[Z,]/3] < Pt[\Z, - E[Z,]\ > MM] 

^ 9 Var[Z,] 
- 4 ■ E[Z,]^ 

~ ^' {Ey^A^P^iv,A)xty - 8' 

where the last inequality uses ([2]). 

Now, suppose the solution x does not cover at least ki edges of Pi. Then, 
restricted to the sub-graph of G where we include edges in Pi only, and remove 



all vertices in A, the total degree of the vertices picked by our algorithm (in 
V \A) will be less than 2ki{A). In other words, 



'^l3,{v,A)x^ < 2. 



But lemma [5] shows that the probability of this event is at most 3/8. This proves 
the theorem. 

Thus, in expectation, more than half of the partitions get satisfied. To satisfy 
all the partitions, we just repeat our algorithm 0(log r) times. So, our final 
algorithm is : repeat Algorithm [1] c log r times, where c is a large constant. We 
output the union of all the vertices chosen in each such round. The following 
theorem now shows that our algorithm is an 0(log r)-approximation algorithm. 

Theorem 2. With high probability, the algorithm outputs a feasible solution and 
its cost is O(logr) • J2vev ^^^v- 

Proof. Lemma [5] shows that in any particular round, we cover at least ki edges 
of Pi with probability at least 5/8. So, the probability that we do not satisfy the 
constraint for Pi in any of the rounds is at most 1/r^ for some large constant c', 
and hence, by union bound, our algorithm outputs a feasible solution with high 
probability. Also, the expected cost of each round is at most 6^„gy c„x*. This 
proves the theorem. 

Solving the LP Relaxation Finally, we show how we can get a solution x* 
for the PVC-LP. We first guess the value of the optimal solution - call it A (we 
can always do this up to any constant precision by binary search). We convert 
the LP to a feasibility LP by removing the objective function, and adding a 
constraint 



Now, we use the ellipsoid method to solve the LP. Given a candidate solution 
X, we first check if it satisfies the above constraint - if not, we can just return 
this violated constraint. Otherwise, we define A = {v : > 1/6}. We check the 
constraint ([1]) for this set A, and again, if this is not satisfied, we can return this 
as a violated constraint. Now, notice that our rounding algorithm just requires 
the solution x* to satisfy these two inequalities, and we need not even check all 
the (exponentially many) constraints ([1]). 

3.1 Extensions 

We now show that our result can be extended to more general settings. 
The Knapsack Partition Vertex Cover problem : Recall that in this prob- 
lem, we have weights We associated with each edge e. Again given a partition 
Pi , . . . , Pr , and parameters Ui , we would like to pick a minimum cost subset of 
vertices such that they cover edges of cost at least Ui from the set Pi for each 




V 



i. Our algorithm and analysis extend in straightforward way to this setting as 
well. 

The sets Pi need not be disjoint : Our analysis does not require these sets 
to be disjoint. The same algorithm works here as well. Note that our hardness 
results holds in the stronger setting where we want these sets to be disjoint. 

4 Conclusion 

We have presented algorithms for the Part it ion- VC problem. For this problem 
using primal-dual schema similar to the one described by Tim Carnes & David 
Shmoys Q we can obtain an 0(/)-approximation algorithm, where / is the 
maximum number of edges in a partition Pi. The proof is quite straight forward. 
This result is analogous to the /-approximation result for the Set Cover prob- 
lem [ill . [l[ . It will be interesting to extend our techniques to partition versions of 
other partial covering problems. One natural related problem is the Partition 
Set Cover problem. The Partition Set Cover problem can be seen as a gen- 
eralization of the Partial Set Cover problem where Pi, . . . , forms partition 
of the element set, and the goal is to find a minimum cost sub-collection of sets 
such that atleast ki elements are covered from partition Pi . For that we can get 
a HQ2p. fci)-approximation by directly extending Slavik's 20] greedy approach, 
and unless P=NP we cannot do any better. 
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